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ABSTRACT 


Analysts  conducting  experiments  must  frequently 
deal  with  situations  in  which  data  is  incomplete  or 
missing.  This  creates  problems  that  can  seriously 
affect  classical  hypothesis  testing  by  introducing 
extraneous  terras  into  the  hypothesis  in  a  complicated 
way.  A  technique  exists  that  allows  an  analyst  to 
determine  precisely  which  experimental  terms  are 
actually  present  in  a  proposed  hypothesis  and  what 
that  hypothesis  would  actually  be  testing  if  employed. 
This  paper  examines  the  mathematics  underlying  the 
technique  and  applies  the  theory  to  a  widely  used  data 
analysis  computer  package.  A  computer  program  is 
presented   to  facilitate  implementation  of  the  method. 
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I.   EXECUTIVE  SUMMARY 


A.   PR03LEM 


At  times  even  the  most  carefully  designed  and  executed 
experiments  can  be  plagued  with  aborted  tests  or  missing 
data.  Such  unbalance  in  the  data  can  have  a  significant 
impact  upon  the  mathematical  structure  of  analytic 
techniques  used  in  analysis  of.  variance.  In  addition  to 
increasing  the  complexity  of  computations,  unbalanced  design 
can  also  seriously  affect  hypothesis  testing.  3ecause  of 
lack  of  balance,  hypotheses  purporting  to  test  the  influence 
of  a  main  effect,  for  example,  may  be  hopelessly  confounded 
with  interaction  terms.  Blindly  "testing"  such  confounded 
hypotheses  without  an  appreciation  of  the  level  of  pollution 
from  extraneous  terras  can  lead  to  serious  error  in 
interpreting  results.  It  is  desirable  to  find  a  general 
procedure  for  use  with  analysis  of  variance  that  can 
determine  exactly  what  a  proposed  hypothesis  is  testing  in 
terms  of  the  main  effects  and  interactions. 


B.   APPROACH 


Because  of  its  mathematical  power  and  notational 
simplicity,  the  matrix  form  of  the  linear  model  Y  =  Xb  +  e 
is  used  in  deriving  a  solution  to  the  problem.  The  linear 
model   leads  to  the  "normal  equations"  X'Xb  =  X'Y.  Since  X'X 


0 
is  in  general  not  of  full  rank,  any  solution  (b  )  for   b   is 

-1 

not  unique.  Further  (X'X)    does  not  exist;  one  must  turn  to 

the  concept  of  a  generalized  inverse  G  of  X'X.  It  can  be 
shown    that   testing   a   hypothesis   H  :  q'b  =  m   involves 

expressing  the  hypothesis  as  a  linear  function  q'GX'X  of  the 

generalized  inverse  (G)  and  X'X.  While  determination  of 
g'GX'X  is  frequently  a  non-trivial  manual  calculation,  it 
can  be  handled  easily  on  a  computer. 


C.   SOLUTION 


If  an  analyst  needs  to  test  a  particular  hypothesis  it 
is  possible  that  additional,  undesired  terms  may  be 
polluting  the  hypothesis  to  such  a  degree  that  his 
interpretation  of  test  results  may  be  completely  invalid.  By 
computing  the  value  of  g'GX'X  he  will  be  able  to  determine 
precisely   what  his  proposed  hypothesis  is  actually  testing. 


CONCLUSIONS 


Recognizing  that  an  unbalanced  design  can  lead  to 
difficulty  in  interpreting  traditional  tests  of  hypotheses, 
it  is  concluded  that: 

1 .  it  is  mathematically  possible  to  determine  the  exact 
nature  of  a  croposed  hypothesis,  and 

2.  such  a  determination  is  feasible  using  a  computer. 


II.   MATHEMATICAL  JUSTIFICATION 


A.   GENERALIZED  INVERSE 


A  generalized  inverse  of  a  matrix  A  is  defined  to  be  any 
matrix  G  satisfying 

AG  A  =  A. 
It  can  be  shown  that,  for  a  given  matrix  A,  G  is  in   general 
not  unique  [Searle,  1971]. 


B.   SOLUTION  OF  CONSISTENT  LINEAR  EQUATIONS 


The  system  of  linear  equations  AX  =  Y  is  consistent  if 
any  linear  relationships  existing  among  the  rows  of  A  also 
exist  among  the  corresponding  elements  of  Y.  Since  linear 
equations  have  a  solution  if  and  only  if  they  are 
consistent,  the  procedures  outlined  below  are  confined  to 
systems  of  consistent  linear  equations. 

The  following  theorems  from  Searle  [ 1  ]  are  stated 
without  proof  in  order  to  develop  solution  procedures  for 
consistent  equations. 

Theorem   1.   Consistent   equations   AX  =  Y   have   a  solution 
X  =  GY  if  and  only  if  AGA  =  A. 


Theorem   2.   if   A   has   q  columns  and  if  G  is  a  generalized 
inverse  of  A,  then  the  consistent  equations  AX  =  Y  have   the 


solution 


0 
X   =  GY  +  (GA  -  I)  Z 


where  Z  is  any  arbitrary  vector  of  order   q.   The   notation 

0 

indicates  that  X  ,  which  satisfies  AX  =  Y,  is  a  solution  and 

not  the  general  vector  of  unknowns  X. 


Theorem    3.    For   the   consistent   equations   AX  =  Y,   all 

solutions   are,    for    any    specific    G,    generated   by 

0 

X   =  GY  +  (GA  -  I)  Z,   for   arbitrary   Z.   That   is,  one  need 

derive  only  one  generalized  inverse  of  A  in  order  to  be  able 
to  develop  all  solutions  to  the  system  AX  =  Y. 


C.   THE  SPECIAL  CASS  OF  SYMMETRIC  MATRICES 


The  linear   model   used,   inter   alia,   in   analysis   of 
variance  involves  the  system  of  consistent  linear  equations 

X'Xb  =  X'Y 
that  are   solved   for   b.   It   is   therefore   worthwhile   to 
consider   the   special   case   of  the  symmetric  matrix  X'X  in 
some  detail.   The  following  development  is  from  Searle  [  1  ]. 

Lemma  1.  X'X  =  0  implies  X  =  0. 

Proof:  This  is  true  because  if  X'X  =  0,  the  sums  of   squares 

of  the  elements  of  each  row  equal  zero,  hence  must  be  zero. 

Lemma  2.  PX • X  =  QX'X  implies  that  PX'  =  QX'. 
Proof:  Apply  Lemma  1  to  the  identity 

(PX'X-QX'X)  (P-Q)  '  =  (PX'-QX')  (PX'-QX')  »  =  0. 
That  is,  (PX'-QX')  (PX'-QX')'  =  0  implies  that   (PX'-QX')  =  0 
which  implies  that  PX •  =  QX ' . 
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Theorem  4.  When  G  is  a  generalized  inverse  of  X'X,  then 
i.  G1  is  also  a  generalized  inverse  of  X'X; 
ii.  XGX'X  =  X  (i.e.,  3X'  is  a  generalized  inverse  of  X) ; 
iii.  XGX'  is  invariant  to  G. 
Proof  : 

(i)  By  definition 

X' XGX'X    =    X'X 
transposing  X'XG'X'X   =    X'X         establishing    (i) . 


(ii) 

by  Lemma  2 
transposing 


X'XG'X'X  =  X'X 

X  '  XG  '  X  '  =  X  ' 

XGX'X  =  X      establishing  (ii) 


(iii)  Suppose  F  is  some  generalized  inverse  different 
from  G.  Then  XGX'X  =  X  =  XFX'X.  3y  Lemma  2  XGX'  =  XFX'. 
That  is,  XGX'  is  the  same  for  all  generalized  inverses  of 
X'X,  establishing  (iii). 


D.   THE  LINEAE  MODEL 


The  general  linear  model  is  Y  =  Xb  +  e  where  Y  is  an 
n  x  1  vector  of  observations  whose  components  are  random  and 
observable;  X  is  a n  n  x  p  matrix  of  experimental  design 
whose  components  are  real  and  known;  b  is  a  p  x  1  vector  of 
parameters  whose  components  are  real  and  unknown;  e  is  an 
n  x  1  vector  of  experimental  error  whose  components  are 
random  and  unobservable.  The  vector  e  is  defined  as 

e  =  Y  -  E(Y) 

E(e)  =  S(Y)  -  E(S(Y))  =  0, 
and  E(Y)  =  E  (Xb)  +  E(e) 

E(Y)  =  Xb. 

2 
Every  element  in  e  is  assumed  to  have  the  same   variance   v 
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and   zero   covariance   with   every   other  element,  thus  e  is 

2  2 

distributed  (0,v  I)  and  Y  is  distributed  (Xb,v  I).   Deriving 

the  normal  equations  for  the  linear  model  yields 

X'Xb  =  X'Y 

0 
which   can   be   solved   for   b    using   the   techniques    of 


generalized  inverses  described  earlier,  i.e., 

0 


and 


wnere 


b   =  GX'  Y 

0 
E  (b  )  =  E  (3X»Y) 

=  GX'  2  (Y) 

=  GX' Xb 
=  Hb 
H  =  GX'X. 
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III.   THE  CONCEPT  OF  ESTIMABILITY 


A.   ESTIMABILITY 


As  defined  by  Searle  [ 1  ] ,  a  linear  function  q'b  of  the 
parameters  in  b  is  estimable  if  it  is  equal  to  any  linear 
function  [t'E(Y)]  of  the  expected  value  of  the  ooservations 
in  Y.  It  is  important  to  note  that  t1  is  not  in  general 
unique;  -he  only  requirement  for  estimability  is  that  such  a 
vector  exist. 


B.   PROPERTIES 


The    definition   of    estimability    leads 
mathematical  properties  of  immediate  importance: 


to 


four 


(1)  The  expected  value  of  any  observation  is  estimable. 
In  this  case  t1  is  a  vector  with  a  single  element  equal  to 
one;  the  rest  of  its  elements  are  zero. 

(2)  Any   linear   combination   of  estimable  functions  is 

estimable.   If  q'b  and  r'b  are  estimable,  then  q'b  =   t'E(Y) 

and   r'b   =   s'E(Y).   Therefore   c  q'b   +   c  r'b   =   (c  t'  + 

1  2  1 

c  s')E(Y)  which  is  estimable. 
2 

(3)  An  alternative  form  of  the  condition  of  estimability 
can  be  developed  as  follows.  If  q'b  is   estimable,   then   by 
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definition  q'b  =  t*E(Y)  hence  g'b  =  t'Xb.  This  must  hold  for 
all  values  of  b  since  the  condition  of  estimability  does  not 
depend  on  a  specific  choice  of  b.  This  leads  to  the  result 
q<  =  t'X. 


0  0 

(4)   When   g'b    is   estimable,  g'b   is  invariant  to  the 

0 
solution  used  for  b   because 

0       0 
g'b   =  t' Xb   =  t 'XGX' Y. 

0 
Since    by   Theorem   4,   XGX*   is   invariant   to   G,  g'b   is 

0 

invariant  to  G  and  therefore  to  b   when   g'b   is   estimable. 

Herein   lies   the   essential  importance  of  estimability:   if 

0 
g'b  is  estimable,  q'b   has  the  same  value  for  all   solutions 

0 
b  . 


That  is,  an  estimable  function  is  a  linear  function  of 


the  parameters  that  is  invariant   to   whatever   solution   is 
0 


used  for  b  . 


C.   THE  TEST 


A  function  g'b  is  estimable  if  there  exists  some  vector 
t'  such  that  g'  =  t'X.  Finding  such  a  vector  t1  may  be  a 
formidable  task  with  a  design  of  large  dimensions.  As  an 
alternative,  it  is  possible  to  test  for  estimability  by 
determining  if  g'H  =  g' .  Searle  [1]  shows  that  g'b  is 
estimable  if  and  only  if  g'H  =  g',  as  follows. 
If  g'b  is  estimable 

g'  =  t'X 

g'H  =  t'XH 

g'H  =  t'XGX'X 
by  Theorem  4  GX'  is  a  generalized  inverse  of  X, 
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hence  g'H  =  t'X 

g'H  =  q1 . 

On  the  other  hand,  if 

q'  =  g'H, 
g'  =  g'GX'X 

and  g'  =  t'X  for  t' 


q'GX' 


D.   THE  CONSTBAINED  MODEL 


1  •   ^§Z^i°£iE§D.l 

The  normal  eguations  X'Xb  =  X'Y   form   a   consistent 

system   of  linear  equations  where  X  is  of  rank  r<p.   Because 

X'X  is,  in   general,   not   of   full   rank,   there   are   many 

solution   vectors  that  will  satisfy  the  system.   In  order  to 

0 
obtain  a  particular  solution  b  ,  additional   constraints   or 

the   form   Cb  =  0   are  often  added  to  the  model.   A  commonly 
used  set  of  constraints  satisfies  the  restrictions 

*  the  main  effects  sum  to  zero 

*  the  interaction  effects  sum   to   zero   across   each 
subscript. 

Adding  the  constraints  Cb  =  0,  where  the  (p-r)  rows 
of  C  are  linearly  independent  of  the  rows  of  X,  yields  the 
following  system  of  linear  equations: 


b  + 


The  constraint  matrix  C  can  be  used  to  transform  the  design 
matrix  X  into  a  constrained  matrix  X  by  performing  basic 
row  operations  on  the  system  of  linear  equations. 
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1-6 


X 

c 


b  + 


1-6-1 


Note  that  X   is  the  same  size   (n   x   p)   as   X;   b   remains 

unchanged.  The  practical  effect  of  introducing  the 
constraints  into  the  design  matrix  is  to  make   some   of   the 

columns   of   X   consist   entirely  of  zeros.  While  b  remains 

unchanged,  the  transformation  of  X  into  X  has  the  effect  of 
"deleting"  some  elements  of  the  parameter  vector  by  the 
mechanism  of  creating  those  columns  of  zeros.  Once  the 
constraints   have   been   integrated   into  the  design  matrix, 

transforming  X  into  X  ,  the  constraints  become  redundant  and 

can   be   removed   from  the  model  by  the  following  technique. 

Let  A  oe  a  (n,n+p-r)  matrix  such  that 

A    =  0  if  i  #  j 
ij 

A    =  1  if  i  =  j. 

ij 

Then  multiplying  by  A, 


A  .  .  .   =  A 

0 


X 

c 


b  +  A 


e 
0 


yields   the   constrained   linear  model  Y  =  X  b  +  e,  which  is 
equivalent  to  the  constrained  system  above. 

2 
Since   e   is   assumed   normal   (0 ,  v  I)  ,   Y   is  also 

normal;  E(Y)  =  X  b.  The  normal  equations,  X  'X  b  =  X  'Y,  can 

0 
be   solved   for   a   particular   solution   b    that  will  also 

satisfy  the  original  normal  equations  X'Xb  =  X'Y.  If   G    is 

*   *        0     *  * 
defined  as  the  generalized  inverse  of  X  ' X   then  b   =  G  X  'Y 
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and  it  follows  that  G  X  •  is  a  generalized   inverse   of   X  . 
Let  H   =  G  X  "X  . 


It  is  stressed  that  this   constrained   linear   model 

was   developed   solely   for   the   purpose   of   finding   some 

0 
particular  solution  vector  b    to   the   original   system   of 

normal   eguations.  In  the  discussion  that  follows,  X   is  the 

same  size  as  X  and  the  parameter  vector  b  is  the  same  in  the 
constrained  linear  model  as  it  was  in  the  original  linear 
model  Y  =  Xb  +  e. 


2 .   exa m£le 


As  a  simple  example  of  the  development  of  X  ,  assume 
that  tne  design  matrix  X  is 

10         0  1 

10         10 

110         0 
Let    C    be 

0  111. 


Then 


X     |    is 

..  I 
C     I 


10         0  1 

10  10 
110  0 
0  111 

Subtracting  the  bottom  row  from  the  top  row  yields 

1-1-1  0 
10  10 
110  0 
0    111 
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which  is 


X 

C 


The  appropriate  A  matrix  is 

10  0  0 
0  10  0 
0    0    10 

Multiplying  yields  X 

1-1-1  0 
10  10 
110    0 


E.   BIOMEDICAL  COMPUTER  PROGRAMS 


The  University  of  California  publishes  and  maintains  the 

BIOMED   series  of  standard  data  analysis  packages  for  use  on 

digital  computers  [Dixon,  1976].  One  of  the  programs   within 

the   package,   BMD05V,  performs  computations  for  analysis  of 

variance  with  the   linear   statistical   model.    The   design 

matrix   employed  is  not  the  same  as  the  design  matrix  (X)  in 

that   model   however.   A   user   of   3MD05V   is   reguired   to 

introduce    appropriate   additional   constraints   to   permit 

0 
computing  a   particular   solution   (b  )   for   the   parameter 

vector.  It  will  be  shown  that  technigues  applicable  to  the 
design  matrix  X  in  the  general  linear  model  can  be  applied 
directly  to  the  BMD05V  design  matrix. 
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F.   APPLICATION  TO  BMD057 


The  constraints,  Cb  =  0,  added  to  the   linear   model   in 
section   D   above   are   the  type  used  to  generate  the  BMD05V 

* 
design  matrix.   The  resulting  matrix  X   has  the  same   number 

of  columns  as  the  original  design  matrix  X,  but  because  some 
of  these  columns  are  zero,  it  is  possible  to  suppress  them 
for  arithmetic  purposes.  For  computational  simplicity,  the 
matrix  actually  used  by  the  computer  program  deletes  all  the 
zero  columns  and  assumes  a  corresponding  "r eparameterized"  b 
vector  of  lower  dimension.   For  mathematical  rigor,  however, 

the   X    used   in   the   following   sections  retains  the  same 

number  of  columns  as  X.   This  restriction  will  be  eased  when 

the   matrix   X   is  actually  applied  to  the  computer  programs 

in  Appendix  B. 


ESTIMABILITY  IN  THE  CONSTRAINED  MODEL 


It  can  be  shown  that  estimability  in  the  constrained 
model  Y  =  X  b  +  e  follows  the  same  pattern  as  estimability 
in  the  full  model  Y  =  Xb  +  e. 


Theorem  5.   g'b  is  estimable  if  and  only  if  g'H   =  g' 
Proof:  By  definition,  g'b  is  estimable  if 

g'b  =  t'E  (Y)  ,  i.e. ,  if 
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q'b  =  t'X  b,  i.e.,  if 

g«   =  t'X  .  Then 

*  *  *  #   * 
q'H   =  t'X  G  X  »X  , 

*  * 

q'H   =  t'X   =  q« , 


and  if         q'  =  q'H  , 


q'  =  q'G  X  'X  .   Let  t'  =  q'G  X  ' 


Then  q'  =  t'X  . 


This  result  allows  computations  to  be  performed 
directly  upon  the  constrained  matrix  in  order  to  examine  the 
estimability  of  proposed  hypotheses.  The  computer  program 
HYTEST  (Appendix  A)  can  accept  either  the  constrained  design 

matrix   X    (with  "zero"  columns  suppressed)  or  the  standard 

design  matrix  X  as  an  input.   If  X  is  used  for  input,  HYTEST 

offers   the  option  of  using  either  the  constrained  matrix  X 

or  the  standard  matrix  X  to  compute  tests  of  estimability. 


Note   that   q'H  b   is   always   estimable   since   q'H  b  = 
q'G  X  'X  b  =  q'G  X  '  E  (Y)  =  t'E(Y)  where  t«  =  q'G  X  '. 


H.   TESTABILITY 


1 .   the  hypothesis 

From   Searle  [1],   all   linear   hypotheses   can   be 
handled  by  a  general  procedure;  specific  hypotheses  are  then 
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considered  to  be  applications  of  this  general  procedure. 


The   general   hypothesis   may  be  written  H  :  K'b  =  m 

0 

where  K'  is  a  matrix  of  s   rows   and   p   columns.   The   only 

limitation  on  K1  is  that  it  have  full  row  rank.  That  is,  the 
hypothesis  must  be  composed  of  linearly  independent 
functions  of  the  parameter  vector. 


2 •   analysis  of  variance 

To  review  analysis  of  variance  briefly,  classical 
techniques  rely  upon  the  ratio  of  two  independent  Chi-square 
distributions,  each  divided  by  its  respective  degrees  of 
freedom,  to  generate  an  F  statistic.  The  sum  of  squares 
explained  by  the  model  if  the  hypothesis  is  assumed  true, 
divided  by  its  deqrees  of  freedom  forms  the  numerator  cf  the 
statistic.  For  many  situations,  the  denominator  is  the  sum 
of  squares  for  error  divided  by  its  deqrees  of  freedom. 
Each  sum  of  squares  can  be  conveniently  represented  by 
appropriate  quadratic  forms  which  must  meet  certain 
requirements  in  order  to  be  Chi-square  distributed. 

Searle's  derivation  of  a  test  of  the  general 
hypothesis   depends   upon   K'b  being  estimable  for  every  row 

k  • b.  If  this  assumption  is  satisfied,  the  quadratic  form 
i 

2        0  -10        2 

Q/v   =  (K'b   -  m)'(K'GK)    (K'b   -  m) /v 

is      distributed      non-central    Chi-square   and    has   rank   s.      The 

sum  of  squares  for  error  can  be  shown  to  be 

-1  -1 

SSE  =  (Y-XK(K'K)   m)  '  (I-XGX*  )  (Y-XK  (K'K)   m)  . 

Q    and   SSE   are   independent   so   F(H)  =  Q/s/SSE/  (n-  r)   is 

distributed  non-central 
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-1  2 

F»[s,n-r,  (K'b-m)  ■  (K»GK)   (K'b-m)/2v  ] 

The  test  statistic  is 

0  -10 

F(H)  =  (K'b   -  ffl)'(K'GK)    (K'b   -  m)  /s/SSE/  (n-r) 

which  is  F  distributed  with  s  and   n-r   degrees   of   freedom 

under  the  null  hypothesis. 


3 .   det er m inat io n  of  testability 


Suppose    that    K'b   is   not   estimable.   Then   the 

hypothesis   H  :  Krb  =  m   is   not   testable.   Assuming    that 
0 

-1 
(K'GK)      exists,    if   one   were   to   compute   F(K)  ,   what 

hypothesis  is  actually  being  tested?  When  working   with   trie 

constrained    linear    model    Y  =  X  b  +  e,   the   answer   is 

"K'H  b  =  m."   The  derivation  which  follows  closely  parallels 

the   procedure   used   by   Searle   [1]   for   the  linear  model 
Y  =  Xb  +  e. 


The  hypothesis  H:  K'H  b   =  m  is  testable  since  K'H  b 

is  estimable  for  each  row   k  '.   The   appropriate   numerator 

i 

quadratic  form  is 

♦  0  ♦  ♦  ♦     —1      *  0 

Q   =  (K'H  b   -  m)  •  (K'H  G  H  ' K)     (K'H  b   -  m) 

*  0       ***** 
But  K'H  b   =  K'G  X  'X  G  X  «Y  and  since 

^c   ^c  $c  ^c   ^c     & 

X  G  X  '=X  G  'X  ' ,  it  follows 

*  o      ***** 
K'Hb   =K'GX'XG»X'Y. 

*     *  *   *  * 

Let   G   =  G  X  'X  G  '  (which  is  a  generalized  inverse  of 
1 


22 


*   * 
X  'X  ) .   Then 

*  0        *  * 
K'H  b   =  K'  G   X  '  Y 
1 

*  0        0 
K'H  b   =  K»  b 
1 

0       *  *  *   *  0       * 

where  b     =  G   X  ■ Y  is   the   solution   to   X  » X  b    =   X  'Y 
1       1 

* 
obtained  from  using  G 

1 


Also 


*  *    *  ******* 
K'HGH'K=K'GX'XGX'XG'K 

*  *    *  *    *       *    * 
K'HGH'K=K'GX'XG'K 

*  *    *  * 
K'HGH'K=K'G       K 

1 

Therefore   Q      reduces    to 
1 

0  *      -1  0 

Q      =     (K'b         -    m)  '  (K'G      K)        (K'b         -    m) 
11  11 

Which   is   the   quadratic   form   that   would    result    from 

attempting  to  test  the  non-testable  hypothesis  K'b  =  m  using 

0      *  * 
the    solution    b    =  G   X  'Y.    These    calculations    are 

1      1 

indistinguishable      from      those      that      would      be  performed    in 


testing    the    testable   hypothesis    K'H   b    =   m. 
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IV.   SYNOPSIS  OF  RESULTS 


CONCLUSIONS 


1.  A  given  linear  function  of  the  parameter  vector  (q'b) 
is  estimable  if  and  only  if  q'H  =  g'. 

2.  Since  only  estimable  functions  are  testable,  if 
q'H  ±    g'  then  the  hypothesis  actually  being   tested   is   not 

H  :  q'b  =  m  but  rather  H  :  q ' Hb  =  m. 
0  0 

3.  The  mathematics  developed  for  proving  both  of  the 
preceding  conclusions  can  also  be  applied  to  a  constrained 
design  matrix,  such  as  that  used  in  the  3MD05V  program,  to 
allow  determination  of  estimability  directly.  That  is, 

■  the  function  q'b  is  estimable   if   and   only   if 

q'H   =  q ' ,  and 

■  if  q'b  is   not   estimable   then   the   hypothesis 


H  :  q'b  =  m ■ is  actually  testing  H  :  q'H  b  =  m. 
0  0 


B.   SIGNIFICANCE 


The   results   presented   above   afford   a    mathematical 
justification   of   the   need   for   a   computer   program   to 
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determine  the  testability  of  proposed  hypotheses.  A  sample 
program,  HYTEST,  is  presented  in  Appendix  D.  Using  the 
HYTEST  program  permits  an  analyst  to  determine  not  only 
whether  each  of  his  hypotheses  is  testable,  but  also 
precisely  what  main  effects  and  interactions  confound  each 
hypothesis  that  is  not  testable.  Such  information  can  be 
used  to  make  more  informed  decisions,  a  priori,  on  the 
design  of  experiments,  and  a  posteriori  on  the  analysis  and 
interpretation  of  experimental  results.  Knowing  the  nature 
and  degree  of  confounding  may  not  necessarily  ease  decision 
making.  It  can  help  to  ensure  that  once  a  hypothesis  is 
accepted  or  rejected  the  analyst  is  aware  of  the  degree  of 
"purity"  of  his  conclusions  concerning  the  effects  of 
various  factors  in  the  experiment.  If  used,  these  techniques 
can  prevent  an  analyst  from  complacently  assuming  that  he  is 


testing    one   hypothesis   when   he 
something  quite  different. 


is 


in 


rac 


testing 
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APPENDIX  A 


HYTES' 


HYTEST  is  a  FORTRAN  IV  program  that  facilitates 
determining  if  a  proposed  hypothesis  is  testable.  The  User's 
Guide  contains  operating  instructions  and  a  list  of  options. 

FLOWCHART 


INPUT  PROBLM  DATA 


DETERMINE  TYPE  OF 
INPUT  MATRIX 


READ  STANDARD 
DESIGN  MATRIX 


I 


INPUT  EACH  AUXILIARY 
EQUATION  IN  TURN  AND 
DO  BASIC  ROW  OPERATIONS 


I 


— f 


OUTPUT  BMD05V  DESIGN  CARDS 


T 


READ  BICMED 
MATRIX 


5 
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© ► 


i 


COMPUTE  GENERALIZED 
INVERSE  G 


I 


CHECK  ACCURACY  OF 
COEFFICIENTS  BY 
X»X  -  X'XG'X 


© K 


DETERMINE  IF 
ANY  HYPOTHESES 
ARE  TO  BE  TESTED 


YES 


NO 


JBL 


INPUT  HYPOTHESIS 


T 


ENSURE    Q!GQ    IS 
NON-SINGULAR 


COMPUTE  Q' H 


IF  Q»H  =  Q« 


YES 


NO 


HYPOTHESIS  IS 
TESTA3LE 


STOP 


CANNOT  TEST 
HYPOTHESIS 


21 


SUBROUTINE 

In  the  interest  of  computational  accuracy  and  speed, 
HYTEST  uses  subroutine  LPSDOR  from  the  International 
Mathematical  and  Statistical  Library  (IMSL)  to  compute  a 
generalized  inverse  G.  This  is  a  proprietary  subroutine. 
Under  the  licensing  agreement,  its  code  may  not  be 
distributed  to  or  used  by  a  user  outside  the  Naval 
Postgraduate  School.  It  may  not  be  used  on  a  non-NPS 
computer  system. 
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APPENDIX  B 


USER'S  GUIDE 


This   guide  contains  complete  operating  instructions  for 
using  the  computer  program  HYTEST. 


the   constrained   design  matrix  X   actually  used  in  the 

computer   programs   3MD05V   and   HYTEST   is   the    same    X 

introduced  in  Chapter  III  except  that  all  zero  columns  have 
been  deleted  to  facilitate  computation.  This  has  the  effect 
of  reducing  the  parameter  vector  b;  an  analyst  using  these 
programs  is  cautioned  to  ensure  that  he  is  aware  of  which 
elements  have  been  "deleted"  from  the  parameter  vector.  See 
Chapter  III,  sections  D,  E  and  F,  for  further  information. 


A.   OVERVIEW  OF  COMPUTER  PROGRAM  HYTEST 


The    program    HYTEST   exploits   technigues   enumerated 
earlier  in  this  paper  to  determine  if  selected  hypotheses  of 

the  form  H  :   k  •b  =  m   are  testable  within  the  framework  of 
0     i 

a  specified  linear  model.   If  an  hypothesis  is  not  testable, 

HYTEST   computes   an   algebraic  form  of  the  parameter  vector 

that  would  actually  be  tested   if  the   proposed   hypothesis 

were  to  be  employed. 
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Given  the  standard  design  matrix   (X)   from   the   linear 
model,   and   appropriate   additional  constraints,  HYTEST  can 

compute  the   design   matrix   (X  )   for   use   in   the   BMD05V 

program.  If  desired,  DESIGN  cards  for  use  in  the  BIOMED 
package  can  te  produced  as  an  auxiliary  '  output.  If  the 
BIOMED   option   is   exercised,   HYTEST   performs   all  of  its 

calculations  upon  X   making  it  essential  for  the  user  to   be 

aware  of  the  exact  structure  not  only  of  the  original  design 

matrix  X,  but  also  of  the  constrained  matrix   X  .   The   user 

can  select  the  option  to  have  all  calculations  performed  on 
X,  if  preferred. 


3.   INPUI  OPTIONS 


The  program  can  accept  either  the  standard  design  matrix 
from  the  basic  linear  model,  or  a  BMD05V  design  matrix. 


OUTPUT  OPTIONS 


1 •   hypotheses 

The  major  option  for  output  concerns  the  testability 
of  user  defined  hypotheses.  For  any  number  of  hypotheses 
from  zero  to  99,  HYTEST  will  determine  if  each  hypothesis  is 
testable,  and  if  not  testable,  the  program  computes  what 
would  actually  be  tested  if  the  specified  hypothesis  were  to 
be  employed.  This  option  is  automaticllly  suppresssed  if 
there  are  no  hypotheses  to  be  tested. 


30 


2 •   design  cards 

If  a  BICMED  design  matrix  is  computed,  the  user  can 
select  an  option  that  will  punch  appropriate  DESIGN  cards 
for  3MD05V;  a  printed  replica  of  the  cards  is  also  produced. 
If  punched  cards  are  not  required,  the  user  can  opt  for  the 
printed  form  of  the  DESIGN  cards  without  having  them 
punched.  This  output  option  is  only  available  if  a  standard 
design  matrix  X  is  used  for  input. 

3 .  accuracy 

An  estimate  of  the  accuracy  with  which  HYT2SI  is 
computing  the  generalized  inverse  matrix  G  can  be  obtained 
upon  request.  The  output  consists  of  the  matrix  resulting 
frcm  subtracting  X*X  -  X'XGX'X.  If  the  computer  were 
perfectly  accurate,  all  entries  would  be  zero.  Because  of 
arithmetic  inaccuracies  in  computing  G,  the  entries  are 
frequently  not  zero.  Since  the  matrix  H=GX'X  is  used  in 
assessing  the  testability  of  hypotheses,  X'X  -X'XGX'X 
affords  an  estimate  of  the  accuracy  to  be  expected  when 
determining  the  nature  of  the  hypothesis  being  tested. 

4 .  H  =  G X^X 

The  H  matrix  used  in  determining  the  testability  of 
selected  hypotheses  will  be  printed  if  requested.  Selecting 
this  option  will  generate  printed  output  for  all  of  the 
previous  options  as  well. 


31 


5 •   generalized  inverse  G 

The  final  option  prints  out  the  generalized  inverse 
of  X'X  used  in  computations  within  HYTEST.  This  option 
automatically  includes  printed  output  for  all  previous 
options  as  well. 


D.   INPUT  REQUIREMENTS 


1 .   card  order 

Input  cards  must  be  in  the  following  order: 

PR03LM  card  (required) 

Design  matrix  cards  (required) 

AUXEQN  card  (required  if  standard 

design  matrix  input  option  is  used) 
Auxiliary  equations  (optional) 
Hypotheses  (optional) 
FINISH  card  (required  after  last  problem) 

2  .   problem  card  _£r equir ed]_ 

The  HYTEST  PROBLM  card,  based  upon  a  similar  card 
used  in  the  EMD05V  program,  is  used  to  set  up  various 
program  parameters  and  options. 

DATA  COLUMNS      RESTRICTIONS 

PRCELM  1-6 

User's  optional  problem  number        7-8 
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Number  of  design  card  sets  9-11 

Number  of  columns  in  design  matrix    12-13 

Blank  14-15 

Number  of  hypotheses  16-17 

Blank  18-26 

Output  options  27 

Blank  28-71 

Input  options  72 


1  <  ND  <  150 

1  <  NC  <  60 

1  <  NH  <  99 


0=  X  matrix 


1=  X   matrix 


Output   options   are 
table. 

IF  COLUMN  27  CONTAINS 
0 


selected   from   the   following 

THE  PRINTED  OUTPUT  WILL  INCLUDE 

A.  An  assessment  of  what  sach 
hypothesis  is  testing 

B.  Design  cards  (if  the  3I0MED 
output  option  is  being  used) 
plus  option  0 

C.  Accuracy  of  coefficients  and 
cption  0 

D.  The  a  matrix  (H=GX'X); 

plus  all  output  from  options 
0  and  1 

E.  The  generalized  inverse  G 
plus  all  previous  options. 


3  •   <L§§if2I}  matrix  cards  (reguired) 
a.  standard  design  matrix 

One  design  card  is  reguired  for  each  unique  row  of 
the  design  matrix.  The  first  2  columns  of  card  i  contain 
(right  justified)  the  number  of  rows  in  the  design  matrix 
that  are  identical  to  row  i.  The  next  60  columns  (3-62)  are 
reserved  for  the  columns  of  the  design  matrix.   Enter  a  zero 
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or  one  in  the  appropriate  card  column.  Each  card  column 
corresponds  to  a  column  in  the  design  matrix  (not  to  exceed 
a  total  of  60  columns) . 

example 
If  a  hypothetical  design  matrix  were: 

110   0 

110   0 

10   10 

10   10 

10   10 

10   0   1 
Only  three  design  cards  are  required: 

021 100    (in  columns  1-6) 

031010 

011001 
b.  3MD05V  design  matrix 

Enter  the  BIOMED  design  matrix  (without  data  cards) 
exactly  as  it  is  used  in  the  3MD05V  package. 


4 .   AUX EQ.N  ca rd  (required  if  the  standa rd  design  matrix 
iB.EM£  i§  used) 


DATA 
AUXEQN 

Number  of  auxiliary  equations 
Punched  card  output  option 


COLUMNS 
1-6 
7-8 
9-1  1 


RESTRICTIONS 


0  <  NA  <  99 
1=Design  cards 

Punched 
0=Design  cards 
Not  punched 
Caution:  Do  not  use  this  card  if  the  BIOMED  input  option   is 
exercised. 


If  there  are  no  auxiliary  equations,  the  program 
will  perform  its  computations  on  the  standard  design  matrix. 
None  of  the  EIOMED  output  options  are  then  available. 
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5  •   Auxiliairz  ec[uat  ions  (optional) 

Auxiliary  equations   are   algebraic   equivalents   of 

BMD05V   constraints.    For   the   example   to  be  used  in  this 

section  assume  the  linear  model 

Y    =u+a+e      i=1 ,2,3 
i.j        i    ij 

Each   auxiliary   equation   (m)   requires   a  separate 

series  of  input  cards.   The  first  card  contains   the   number 

of   parameters   in  equation  m.   As  an  example,  the  auxiliary 

equation 

a   +  a   +  a   =0 
1     2     3 

contains   three   parameters   so   the   first   card   for   this 

eguation  would  contain  03  in  the  first  two  columns. 

The   second   card  contains  the  column  number  and  the 

coefficient  of  the  parameter  whose  column  is  to   be   deleted 

from   the  BMD05V  matrix.   For  instance,  the  parameter  vector 

in  this  example  is 

b'  =  (u,  a  ,  a  ,  a  ) 
12    3 

Parameter   a   corresponds  to  the  fourth  column  of  the  design 

matrix  X.   In  order  to  eliminate  a   from  the  BMD05V   matrix, 

the  second  card  must  contain: 

0401 
in  the  first  four  columns. 

A  separate  card  is  required  for  each  parameter  in 
the  auxiliary  equation,  so  this  example  needs  two  more 
cards : 

0  20  1 

0301 
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corresponding   to  the  columns  and  coefficients  for  a   and  a 

12 

respectively . 


Caution:  if  a  parameter's  column  is  to  be  deleted 
from  the  EMD05V  design  matrix,  that  parameter  must  not  be 
used  in  an  auxiliary  equation;  its  algebraic  equivalent  must 
be  used  instead.  A  complete  set  of  cards  must  be  included 
for  each  separate  auxiliary  equation  (m  cards  for  each 
auxiliary  equation  with  m  parameters) . 

The   above   input   will  produce  the  following  BMD05V 
design  cards: 
DESIGN002001001000 
DESIGN003001000001 

DESIGNOO 10010-10-1 

6  •   Hypothesis  cards  l2£t ional]_ 

The   format   for  hypothesis  cards  is  similar  to  that 

for  the  auxiliary  equation  cards.  Assume  that  the  hypothesis 

of  interest  is: 

H  :  a   -  a   =0 
0    1     2 

There  are  two  parameters  in  the  hypothesis   (a   and 

a  )   corresponding   to   columns   2   and  3  in  the  constrained 
2 

BMD05V  matrix.   It  is  important  to  note  that  the   parameters 

used   in   hypothesis   testing   must   be   associated  with  the 

matrix  used  to  compute   H  =  GX'X.   If   the   standard   design 

matrix    X   is   used   throughout   the   program,   the   column 

corresponding  to  a  specific  parameter  is  unchanged  from   the 

original   model   formulation.    If,   on   the  other  hand,  the 

standard  design  matrix  X  is  used  for   input,   and   auxiliary 
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eguations  are  used  to  reduce  X  to  the  BMD05V  matrix  X  ,  then 

the  columns   used   in   the   hypothesis   cards   must   be   the 

* 

appropriate   columns   from   X  .    The   first  card,  for  this 

example  hypothesis,  contains  02  (for  two  parameters)  in   the 
first  two  columns.   The  second  and  third  cards  contain 

021  .0 

03-1.0 
respectively.  The  first  two  columns  identify  the 
parameter's  column  in  the  appropriate  design  matrix.  The 
parameter's  coefficient,  a  decimal  point  and,  if 
appropriate,  a  minus  sign,  must  be  punched  in  columns  3 
through  11. 


7  •   FINISH  card  Ijte^uir ed  after  last  problem) 

If  several  problems  are  to  be  run  in  seguence,  the 
cards  for  each  problem  are  to  be  grouped  in  seguential 
blocks.  Once  HYTEST  finishes  a  problem  it  determines  if 
another  problem  is  to  be  run.  If  so,  it  executes  that 
problem.  It  will  continue  executing  problems  in  sequence 
until  all  problems  are  completed  and  a  FINISH  card  is 
encountered.  The  FINISH  card  must  be  the  last  card  in  the 
data  deck  and  the  word  FINISH  must  be  punched  in  the  first 
six  columns. 
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APPENDIX  C 


EXAMPLE 


A.   THE  MODEL 


The  use  of  HYTEST  can  best  be  illustrated  through  use  of 
an  example.   The  one  chosen  is  from  Searle  [ 1  ]. 


The  mods!  is: 


y  =m+A+B+A3         +e 

ijk  i  j  ij  ijk 


i=1,2,3 


ijk 


j  =  1/.  ..,<» 
k=1,.  .. ,4 

is  the  kth  observation  of  the  ith  treatment  of  the  jth 


type;  M  is  the  mean  effect;  A   is   the   effect   of   the   ith 

i 

treatment;   E    is   the   effect  of  the  jth  type;  A3    is  the 

interaction  between  the  ith  treatment  and  the  jth   type   and 

e     is   the   error  term.   The  number  of  observations  noted 
ijk 

for  each  cell  is  shown  in  the  following  table: 


j=1 

j  =  2 

j=3 

j  =  4 

i=1 

3 

0 

1 

2 

i=2 

2 

2 

0 

0 

i=3 

0 

2 

2 

4 
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The  X  matrix 

is: 

1   2 

3 

a 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

1   0 

1 

0 

1 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

1   0 

1 

0 

1 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

1   0 

1 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

1   0 

1 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

1   0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

1   0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

1   0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1  o 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1   0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

1   0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

1   0 

0 

0 

0 

0 

1 

a 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

1   0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

Suppressing   the 


zero  columns  for  AB   . 

12 


AB  , 
23 


AB   , 
24 


and 


AE    the  input  design  matrix  for  HYTEST  is 
31 

031100100010000000 

011 100001001000000 

021 100000100100000 

021010100000010000 

021010010000001000 

021001010000000100 

021001001000000010 

041001000100000001 


39 


B.   CARD  PREPARATION 


The  size  of  the  BMD05V  matrix  is  controlled  by  the  type 
of  constraints  used  in  the  BIOMED  package.  In  general,  the 
number  of  parameters  for  each  main  effect  and  each 
interaction  effect  is  reduced  by  1  in  each  dimension.  In 
this  example,  1  parameter  is  deleted  for  each  of  the  main 
effects,  A  and  B.  Since  i  =  1,2,3  and  j  =  1,2,3,4  there  are 
3  x  4  =  12  interaction  terms.  Reducing  each  dimension  by  1 
yields  i  =  1,2  and  j  =  1,2,3  for  2x3=6  remaining 
interaction  terms.  As  noted  in  the  preparation  of  the  input 
design  matrix  above,  the  columns  for  four  interaction  terras 
are  zero.  Those*  four  terms  are  therefore  deleted  from  the 
model.  Two  other  interaction  and  two  main  effect  terms  can 
be  deleted  by  addition  of  the  usual  BMD05V  constraints.  For 
this  example,  the. following  constraints  were  adopted: 

(1)  A       +    A       +    A       =    0 

12  3 

(2)  E+B+3+3=0 

12  3  4 

(3)  AB         +    AB         +    AB         +    AB  =0 

11  12  13  14 

(4)  AB  +    AB  +     A3         +    A3  =0 

31  32  33  34 

In   auxiliary   equation   (1)   it   is   desired   to  delete 

parameter  A   which  corresponds  to  column   4   of   the   design 
3 

matrix.    Since   there  are  three  parameters  in  this  equation 

the  first  card  is: 

03 
The  other  cards  are: 

0401 

0  20  1 

0301 
This  completes  the  cards  for  auxiliary  equation  (1). 
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Auxiliary  equation  (2)  is  quite  similar.  If  B   is  chosen 

for  deletion  the  cards  are: 

04 

0801 
0701 
0601 
0501 


Auxiliary  equation  (3)  contains  the  parameter  AB    which 

12 

does   not   appear   in   the   design  matrix  (it  is  a  column  of 

0's).   Under  the  BMD05V  constraints,   it   has   an   algebraic 

equivalent.   AB     =  -AB    -  AB   ,  which  must  be  used  in  its 

12        22      32 

place.  Equation  (3)  then  becomes 

(3a)  AB    -  AB    -  A3    +  AB    +  AB    =0. 

11      22      32      13      14 

The  inpur  cards  used  to  delete  AB    from  the   design   matrix 

14 

are: 

05 

1101 

13-1 

14-1 

1001 

091  1 


The  interaction  term  A3    from  equation   (4).   does   not 

31 

appear   in   the   design  matrix.  Substituting  AB    =  -  A3 

31        11 

AB    into  equation  (4)  yields: 
2  1 


(4a) 


-AB    -  AB    +  AB    +  AB    +  AB     =  0. 
11      21      32      33       34 


The  input  cards  used  to  delete  AB    from  the   design   matrix 

34 

are: 
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05 

1601 

09-1 

12-1 

1401 

1501 

Deleting  a  total   of   eight   parameters   from   the   standard 

design   matrix  will  yield  a  BHD05V  matrix  of  twelve  non-zero 

columns. 

123456    7     8     9    10    11    12 

M  A   A   E   B   B   AB    AB    AB    AB    AB    AB 
12   12   3    11    13    21    22    32    33 


C.   HYPOTHESES 


To   verify   that   A   -  A   +  AB    -  AB    is  estimable  but 

12      11      21 

that  A   -  A   is  not,  the  following  cards  are  used: 
1     2 

04 

021  .0 

0  3-1.0 

071  .0 

0  9-1.0 

02 

021  .0 

03-1.0 


After  completing  the  PROBLM  card,  the  AUXEQN  card  and 
the  FINISH  card  as  outlined  in  the  User's  Guide  and  putting 
the  cards  in  the  correct  order,  one  is  ready  to  run  HYIEST. 

The  outputs  of  major  interest  are  the  DESIGN  cards  and 
the  hypotheses.  In  the  interest  of  brevity,  only  those  two 
outputs  will  be  illustrated. 
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D.   SAMPLE  OUTPUT 


Design  cards  suitable  for  BMD05V  will  be  similar  tc   the 
sample  shewn  below. 


DESIGN 

3   1 

I   1 

0 

1 

0 

0 

1 

0 

0 

0 

0 

0 

DESIGN 

1 

I   1 

0 

0 

0 

1 

0 

1 

0 

0 

0 

0 

DESIGN 

2   ' 

I   1 

0 

-1 

-1 

-1 

-1 

-1 

0 

1 

1 

0 

DESIGN 

2   ' 

I   0 

1 

0 

0 

0 

0 

1 

0 

0 

0 

DESIGN 

2 

I   0 

0 

1 

0 

0 

0 

0 

1 

0 

0 

DESIGN 

2 

1  -1 

- 1 

0 

1 

0 

0 

0 

0 

0 

1 

0 

DESIGN 

2   ' 

I  -1 

-1 

0 

0 

1 

0 

0 

0 

0 

0 

1 

DESIGN 

4 

1  -1 

-  "] 

- 1 

-1 

-1 

1 

0 

1 

0 

-1 

-1 

The  column  on  the  left  below  represents  the  vector  q1  in 
the  hypothesis  H:  g'b  =  m.  The  column  on  the  right  below 
represents  the  vector  q'h.  If  g'  =  q'H,  the  hypothesis  is 
testable.  If  g*  *  q'H,  the  hypothesis  is  not  testable. 


THE  HYPOTHESIS 
OF  INTEREST  IS 
(HO:  Q'B  =  M) 

0.0 

1.0 
-1  .0 

0.0 

0.0 

0.0 

0.0 

0.0 


THE  HYPOTHESIS  WHICH  IS 
ACTUALLY  BEING  TESTED 
(HO:  Q'HB  =  M) 
0.  115 
0.748 
-0.656 
-0.084 
-0.298 
0.038 
0.221 
0.099 
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0.0 

0.0 
0.0 

0.0 


-0.374 

-0. 160 

0.275 

-0.061 


WITHIN  THE  LIMITS  OF  COMPUTATIONAL  ACCURACY, 
HYPOTHESIS  1  CANNOT  BE  TESTED  AS  STATED  ABOVE 


THE  HYPOTHESIS 
OF  INTEREST  IS 
(HO:  Q'B  =  M) 

0.0 

1.0 
-1  .0 

0.0 

0.0 

0.0 

1  .0 

0.0 
-1  .0 

0.0 

0.0 

0.0 

0.0 


THE  HYPOTHESIS  WHICH  IS 
ACTUALLY  BEING  TESTED 
(HO:  Q'HB  =  M) 

0.0 

1  .0 
-1.0 

0.0 

0.0 

0.0 

1  .0 

0.0 
-1  .0 

0.0 

0.0 

0.0 

0.0 


WITHIN  THE  LIMITS  OF  COMPUTATIONAL  ACCURACY, 
HYPOTHESIS  2  CAN  BE  TESTED  AS  STATED  ABOVE 

As    expected    (Searle   [1])/    one    hypothesis  is      estimable; 

the        other        is      not.         Additional      output  information      is 

available      from      HYTEST      if      desired.         The  User's        Guide 
enumerates    all  of    the   options. 
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APPENDIX  D 


COMPUTER  PROGRAM 


REAL*8  £P,BC,BF 

DIMENSION  X(150,6C),NREPS(150)  ,XS( 150,60) 

CIMENSICN  XR(60) , I D( 60 ) ,XSX( 60 , 60 ) , tfKAREA { 3640 J ,Q(60) 

CIMENSICN  GH(60),  H(60,60) 
CATA  BP/'PROBLM   ■  /  ,B  F/  »F  INISH   */ 
100C   FCPMAT  ( '0' , 'PROBLEM  NUMBER', 14) 

1001  FCRMAT  CO', 'NUMBER  OF  DESIGN  CARC  SETS',16) 

1002  FCRMAT  ('  0 «,' NUMBER  CF  INDEPENDENT  VARIABLES  IN  DESIGN 
1  N AT R  IX*  ,15) 

1002   FCFMAT  CO', 'NUMBER  OF  INDEPENDENT  VARIABLES  IN  8MC05V 

1  MATRIX'  ,  15) 
101C   FCFMAT  («0«, 'WITHIN  TFE  LIMITS  CF  CCMPUTAT I CNAL  ACCURA 

1CV,  '  ) 

1011  FCRNAT  ('  ',' HYPOTHESIS',  13, 2X,  'CAN  3E  TESTED  AS  STATE 
ID  ABOVE'  ) 

1012  FCRNAT  ('  •  , ' riYPOTHES  I  S ' , I  3 , 2X  ,  •  CANNOT  BE  TESTED  AS  ST 
1ATEC  ASCVE'  ) 

102c   FCPMAT  ( '0'  ,'GENERALI  ZED  INVERSE  MATRIX,  C- '  /  ) 
1027   FCFMAT  CO','H  MATRIX  (GX"X)'/) 
103C   FCFNAT  ('  «,25F5.1) 

1035  FCRMAT  (•  0 ','  ACCURACY  CF  COEFFICIENTS  (X"X  -  X"XGX"X)' 
1/  ) 

1036  FCFMAT  (•  '  ,  10 ( 3X  ,  F10  .  5  )  ) 

1037  FCRMAT  CO') 

1040  FCFMAT  CO*, 'INPUT  OPTICN:  STANCARC  DESIGN  MATRIX') 

1041  FCRMAT  CO', 'INPUT  CPTICN:  EMD05V  CESIGN  MATRIX') 

1050  FCRMAT  ( A6  ,  I  2, 13 ,  12, 2X , I  2 , 8X, I  2  ,44X , 1 1 ) 

1051  FCPMAT  CO', 'FINISH  CARD  MISSING  CR  NEXT  PRCBLM  CARC  C 
1LT  CF  SEQUENCE.  PROGRAM  CANNOT  CONTINUE  AND  IS  TERMNA 
2TING.' ) 

1060  FCPMAT  CO', 'THE  FY POTHES IS • , 10X  ,  •  THE  HYPOTHESIS  WHICH 
IIS') 

1061  FCPMAT  (•  ',5X,F7.3,18X,F7.3  ) 

1062  FCRMAT  (•  • , • OF  INTEREST  I S ', 10X ,' ACTUAL LY  EEING  TESTE 
1C  ) 

1063  FCRMAT  (•  • , • ( HO :   Q"B  =  M ) • , 10X  ,  •  (HO :   Q"FE  =  M)'//) 
1066   FCRMAT  ( • 0 « , ' HYPOTHES  I  S ' , 13  , '  CANNCT  BE  TESTED  BECAUSE 

1  C"GQ  IS  SINGULAR'  ) 
107C   FCFMAT  (6X , 12 , 13 , 12 , 2X , 12 , 8X , I  2 ,44X , 1 1 ) 

1071  FCPMaT  (I2,6CF1.0> 

1072  FCFMAT  (6X , I  3 ,21 F3  .0/6X  ,22F3 .0 ) 
1075   FCFMAT  (I2,30F2.0) 

1C76   FCRMAT  {•  •  ,  • DE S  IGN'  , I  3 ,211 3  /  7X,22I3) 

1077  FCRMAT  CO', 'DESIGN  CARDS'/) 

1078  FCRMAT  (• DES IGN'  ,  I  3 ,21 13/ • DES I GN ',  2213/ » CES  IGN ', 22  13  ) 

1079  FCRMAT  (I2,F9.0) 

1C9C   FCRMAT  CO', 'CESIGN  CARD  NUMBER ',  14, 2X,  •  IS  INVALID') 
C  READ  IN  THE  OESIGN  MATRIX  AND  THE  PRC2LEM  CAFC 

REAC  (5,1070)  NP,ND,N,NH, IPR,IO 
227C   WRITE  (6,1000)  NP 

WRITE  (6,1001)  NO 

WFITE  (6,1002)  N 

IF  (10  .EG.  0)  GO  TO  2305 
C  THIS  READS  IN  THE  BIOMED  DESIGN  MATRIX 

WFITE  (6,1041) 

DC  2300  1=1, ND 
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2300 


C  TF 

2305 


231C 
C  CC 


2313 

2315 

C  RE 
C  AN 


232C 


2323 


2324 
2325 
233C 
C  SU 


2335 


2336 

224C 
C  CI 


2355 


236C 
C  CC 
2365 


PE£ 
IF 

CCN 
NS  = 
GC 
IS  RE 
ViRI 
DC 

IF 
CCN 
NPITE 
PEA 
NS  = 
IF 
CC 
CC 
XS( 

ccn 

GC 
b«  RI 

AD  IN 
C  CC 
CC 
DC 
XB< 
CCN 
RE4 
NVA 
RE4 
XR< 
CC 
RE  A 
XR( 
CCN 
DC 
IF( 
CC 
X(I 
CCN 

ccn 

CCN 
PPRES 
jR  = 
CC 
CC 
IF 
CCN 
CC 
XS( 
CCN 
JP  = 
CCN 
TPUT 
IF 
VtRI 
CC 
CC 
IC( 
IF 
IF 
CCN 
WRI 
IF 
WRI 
CCN 
fPLTE 
DC 
DC 

xsx 

CC 
NFI 


C  (5tl0 

(NR5PS 

TINUE 

N 

TO  2365 

ADS  IN 

TE  (6,1 

2310  1= 

C  (5,10 

(NREPS 

TINUE 

THE  BM 
D  (5,10 
N-NA 
(NA 
2313 
2313 
I ,J)=X( 
TINUE 
TO  2365 
T5  (6,1 

EACH  A 
THE  EAS 
2330  IS 
2320  IR 
IR)=0.0 
TINUE 
C  (5,1C 
L  =  NV/aL- 
C  (5,10 
ICC  IS)  ) 
2323  IV 
(5,10 


•  NE 

1  = 
J  = 


NL)=XVA 

TINUE 

2325  IC 

X(  IC,  ID 

2324  JX 

C,JX)=X 

TINUE 

TINUE 

TINUE 

S  ZERO 

1 

2340  JT 

2335  JX 
(ID( JX) 
TINUE 

2336  IS 
IS,JR)= 
TINUE 
JR  +  1 
TINUE 
EMD05V 
(IPR 
TE  (6, 
2360 
2355 
K)=0 
(XS(  J 
(XS(  J 
TINUE 
TE  (6,1 
(IPCH  . 
TE  (7,1 
TINUE 

x'x=xs 

2380  1= 
2380  K= 
(I,K)=0 

2380  J= 
X=NREPS 


72)  NREPS(I),  (XS(I,J)  ,J  =  1,N) 
(I )  .LE.  0)  GC  TO  225C 


THE  STANDARD  CESIGN  MATRIX 

040) 

1,ND 

71)  NREPS  (I  ),  (X(I  ,JR),JR=1,N) 

(I )  .LE.  0)  GO  TO  2250 

C05V  MATRIX  (X*) 
70)  NA,IFCH 

.  0)  GO  TO  2315 

1,ND 

1,NS 

I, J) 


003)  NS 

UXILIARY  EQUATION  IN  TURN 

IC  ROW  OPERATIONS 

=  1,NA 

=  1,N 


75)  NVAL 
I 

75)  ID(IS),XVAL 

=  XVAL 

=1,NVAL 

75)  NL,XVAL 

L 


=  1,ND 
(IS))  .EC 
=  1,N 
(IC,JX)-XR( JX) 


0.0)  GO  TO  2325 


ROWS  AND  CCLUMNS 


=  1,N 
=  l,NA 
.EQ.  JT) 

=  1,ND 
X(  IS,JT) 


GO  TO  2340 


L 

1 

J  = 

K  = 

,K 
,K 


CESIGN  CARDS 

T.  1)  GC    TO  2365 

C77) 

i,ND 

1,NS 

)  .LE.  -0.5)  ID(K) 
)  .GE.   0.5)  ID(K) 


-1 

1 


076)  NREPS  (J) , ( IC(K)  ,K  =  1,NS) 

LT.  1)  GC  TO  2360 

078)  NREPS(J),  ( IC(K)  ,K=1,NS) 


It  NS 

1,NS 

.0 

1,ND 

(J) 
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238C 

C  TR 
C  IN 


229C 


2292 


2392 


C  CC 

2395 


2400 


240 


2402 


C  O 
2405 


2410 

2415 
2411: 

2420 


C  IN 
2422 


2422 


2425 


CC 
X<> 

CCN 
CAL 

ANSPC 

VEFSE 
CC 
CC 
X(J 
CCN 
IF 
WRI 
Nk  = 
NEN 
IF 
CC 
WRI 
CCN 
WRI 
N>  = 
IF 

PPUTE 
CC 
DC 
MI 
CC 
H(I 
CCN 
IF 
V>R  I 
N  In  = 
NEN 
IF 
CC 
WRI 
CCN 
kFI 
Nk  = 
IF 

ECK  F 
IF 
CC 
CC 
>S( 
CC 
XS( 
CCN 
CC 
DC 
XS( 
CCN 
WRI 
Nk  = 
NEN 
IF 
CC 
WRI 
CCN 
WPI 
NV  = 
IF 

PUT  T 
IF 
CC 
CC 
Q(I 
CCN 
REA 
CC 
REA 
C  (N 
CCN 


2380  J 
<I,K)= 

TINUE 
L  LPSD 
SE  TFE 

(  IN  X 
2390  J 
2290  I 
,1  )=  H 
TINUE 
(IPR  . 
TE  (6, 
1 

C=NW+9 
(NENC 
2393  I 
TE  (6, 
TINUE 
TE  (6, 
NW  +  10 
(NEND 

F  =  G 
2400  I 
2400  K 
,K)=0. 
2400  J 
,K)=X( 
TINUE 
(  IPR  . 
TE  (6, 
1 

C=NW+9 
(NENC 
2403  I 
TE  (6, 
TINUE 
TE  (6, 
N*v+lC 
(NENC 

BY  CG 
(IPR  . 
2410  I 
2410  K 
I,K)=0 
24i0  J 
I,K)=X 
TINUE 
2415  I 
2415  J 
If JJ=X 
TINUE 
TE  (6, 
1 

D=NW+9 
(NEND 
2420  I 
TE  (6, 
TINUE 
TE  (6t 
NW+1C 
(NENC 
FE  HYP 


(NH  .E 
2200  L 
2423  I 
)=0.C 
TINUE 
D  (5,1 
2425  I 
D  (5,1 
G)=HVA 
TINUE 


FIX=1,NFIX 

XS( J,I)*XS( J,K)+XSX( I ,K) 
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